Conquering Memory Bandwidth Challenges in High-Performance SoCs
ثبت نشده
چکیده
High end System‐on‐Chip (SoC) architectures consist of tens of processing engines. In SoCs targeted at high performance computing such as HDTV chips, the communication bandwidth is dominated by accesses to the off‐chip DRAM. Limited available bandwidth at the DRAM memory continues to be one of the most important constraints driving the design of these SoCs. Newer DRAM memories such as DDR‐3 impose a minimum access granularity that can lead to memory wastage when configured to attain the bandwidth requirements of the application. In this paper, we introduce and highlight multi‐channel DRAM memories in high‐performance SoCs as a solution for the memory bandwidth problem, and present our communications architecture optimized for the same. We will address deadlock scenarios associated with multi‐channel memories and provide mechanisms to avoid them. We compare our architecture with an industrial interconnect by experimenting with representative high definition digital TV applications. Our results indicate an average of 15% throughput improvement over the commercial interconnect solution.
منابع مشابه
Evaluation of mobile ARM-based SoCs for high performance computing
The electrical power consumption of current supercomputers has become the key limiting factor to successfully reach exascale. To further increase energy efficiency, the use of mobile system-on-chips (SoCs) has been proposed in the past. In this work we investigate four development boards equipped with different SoCs with respect to their suitability for HPC. Each of the boards is carefully benc...
متن کاملA Trace-based Workflow for Evaluating Application-specific Memory Bandwidth for FPGA-SoCs
FPGA-SoCs such as Xilinx’s Zynq-7000 and Altera’s Cyclone V SoC provide a great platform for HW/SW-Codesigns. These devices combine a powerful embedded processor with programmable logic similar to that found in FPGAs. For the communication between both parts, FPGA-SoCs provide various interfaces in the logic component that offer access to the DDR memory used by the processor. While high through...
متن کاملPipelined Bidirectional Bus Architecture for Embedded Multimedia SoCs
This paper proposes novel high-performance bus architecture for memory-intensive embedded multimedia SoCs. It has a pipelined bidirectional bus for high speed and small area. It has two separate bus called system bus and memory bus, where memory-intensive IPs are connected to memory bus so not to degrade system bus performance. To avoid starvation of low-priority masters, the proposed bus explo...
متن کاملHydra: An Accelerator for Real-Time Edge-Aware Permeability Filtering in 65nm CMOS
Many modern video processing pipelines rely on edge-aware (EA) filtering methods. However, recent high-quality methods are challenging to run in real-time on embedded hardware due to their computational load. To this end, we propose an area-efficient and real-time capable hardware implementation of a high quality EA method. In particular, we focus on the recently proposed permeability filter (P...
متن کاملMapping of H.264 decoding on a multiprocessor architecture
Due to the increasing significance of development costs in the competitive domain of high-volume consumer electronics, generic solutions are required to enable reuse of the design effort and to increase the potential market volume. As a result from this, Systems-on-Chip (SoCs) contain a growing amount of fully programmable media processing devices as opposed to application-specific systems, whi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012